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Abstract 

The evolution of genetic systems has been analyzed through the use of modifier 
gene models, in which a neutral gene is posited to control the transmission of other 
genes under selection. Analysis of modifier gene models has found the manifesta- 
tions of an "evolutionary reduction principle": in a population near equilibrium, a 
new modifier allele that scales equally all transition probabilities between different 
genotypes under selection can invade if and only if it reduces the transition prob- 
abilities. Analytical results on the reduction principle have always required some 
set of constraints for tractability: limitations to one or two selected loci, two alleles 
per locus, specific selection regimes or weak selection, specific genetic processes 
being modified, extreme or infinitesimal effects of the modifier allele, or tight link- 
age between modifier and selected loci. Here, I prove the reduction principle in 
the absence of any of these constraints, confirming a twenty-year old conjecture. 
The proof is obtained by a wider application of Karlin's Theorem 5.2 (1982) and 
its extension to ML-matrices, substochastic matrices, and reducible matrices. 

Keywords: evolution; evolutionary theory; modifier gene; recombination rate; mu- 
tation rate; spectral analysis; reduction principle; Karlin's theorem; ML-matrix; 
essentially non-negative matrix. 



1 Introduction 

Darwinian evolution occurs through the interaction of two fundamental processes: (1) 
natural selection, i.e. differential survival and reproduction; and (2) genetic transforma- 
tion, i.e. change of genetic content during reproduction, which provides the variation 
upon which selection can act. The principal genetic transformations are sexual repro- 
duction, recombination, and mutation, while a growing list of other transformations 
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includes gene conversion, methylation, deletions, duplications, insertions, transposi- 
tions, and other chromosomal alterations. 

These two processes, augmented by a third — the randomness of sampling in finite 
populations — provide the basis for our causal explanations of the characteristics of 
organisms. In its simplest version: Transformation processes create new genetic states 
among offspring, and differential survival and reproduction of alternate genetic states 
results in the prevalence of states with the highest levels of survival and reproduction. 
In a more sophisticated version: evolutionary dynamics are the result of the joint action 
of selection, transformation, and random sampling processes that move populations to 
distribute over certain regions of genotype space. 

The simpler version of the explanation runs into a quandary when trying to explain 
traits that in themselves do not cause different levels of survival or reproduction, in 
particular, the traits that make up the molecular, cellular, and organismal machinery 
of the genetic transformation processes themselves. Variation in the transformation 
machinery can produce differing distributions of offspring genotypes, without neces- 
sarily affecting the parent's survival or their quantity of offspring. To understand the 
fate of variation in the genetic transformation machinery requires the more sophisti- 
cated version of evolutionary causation, the evolutionary dynamics of the joint action 
of selection, transformation, and sampling processes. 

The earliest mathematical treatments of evolutionary dynamics in t he 1920s 'Mod- 
ern Synthesis ' of Darwinism and Mendelian genetics (Fisher, 1922; Haldane, 1924; 



Wright, 1931) dealt straightaway with many complex issues; however, the first analysis 



of genetic variation in the genetic transformation processes themselves waited another 
thirty years for Kimura's (1956) analysis of a model of recombination modification. 
The model examines the fate of a chromosomal alteration that eliminates recombina- 
tion between two loci with a stable polymorphism that exhibits linkage disequilibrium. 
When the alteration occurs in the chromosome that have above average fitness, it in- 
creases in frequency. This result affirmed Fisher's dl930l p. 130) assertion that "the 
presence of pairs of factors in the same chromosome, the selective advantage of each of 
which reverses that of the other, will always tend to diminish recombination, and there- 
fore to increase the intensity of linkage in the chromosomes of that species." This result 
was the first instance of what was t o later be called the "reduction principle" for the 
evolution of genetic transformations ( Feldmanl 19721; Feldman et al. , 1980; Altenberg, 



1984; Liberman and Feldman, 1986b; Altenberg and Feldman, 1987). 



b; 



Nei ( 19671) introduced, and partially analyzed, a model for the evolution of recom- 
binatio n in which reco mbination rates between two loci are modified by a third, neutral 
locus. Feldmanl ( 1972 ) gave a complete linear stability analysis of the model under the 
assumption of additive, multiplicative, and symmetric viability selection regimes. He 
found that for populations near a polymorphic equilibrium with linkage disequilibrium 
under selection and recombination, genetic variation for recombination would survive 
if and only if it reduced the rate of recombination between the loci under selection. 

Other transformation processes were analyzed with modifier gene models, in par- 
ticular mutation and migration rate s , and the reduction principle was found to emerge 



again (Karlin and McGregor 



1973HKarlin and McGregoi 



19721: iFeldman and Balkau[Jl973t Balkau and Feldman, 
974t ll 



1974 lFeldman and Krakauer CTH^ lFeldman et allll980l) 



(Note that there is more recent acceptance of the idea that spatial location of organisms, 
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and other environmental condi tions, may function formally as heritable trai ts subject to 
transformation processes, e.g. ISchauber et al.l (I2007h : l0dling-Smeel d2007[). However, 
this concept follows quite naturally from the concept of gene ralized transmission that 
appears in models of cultural transmission and modi fier genes (ICavalli-Sforza and Feldman 
19731 iKarlin and McGregorll 974 lAltenberdl 1984 pp. 15-16, p. 178).) 

These first reduction results were derived under narrow constraints on the selec- 
tion regime, number of modifier alleles, or number of alleles under selection. Sub- 
sequent studies of modifiers of these three processes — recombination, mutation, and 
migration — have ext ended the result to cases of modi fi er polymorphisms and arbitrary 
selection coefficients ( Liberm an and Feldman , Il986albt Feldman and Liberman, 1986; 



Liberman and FeldmanT 1989b . but are still restricted to two alleles per selected locus 
or two demes in the case of migration modification. 

The repeated emergence of the reduction result in modifier models of different 
process es led to a stud y of mathematical underpinnings that might be common to all of 
them (I Altenbergt 1 1 9 84) . The approach taken was to represent all possible transmission 
processes — in which recombination and mutation comprise special cases — using 
a general bi-parental transmission matrix, consisting of probabilities T(i <— j, k) that 
parental haplotypes j and k produce a gamete haplotype i. Haplotype here refers to a 
gamete's genotype, or a gamete's contribution to a diploid genotype. (The addition of 
spatial subdivisi ons in the case of migration modification leads to a slightly changed 
representation (Altenbergl ll984 pp. 178-199).) 

This approach allows the particulars of the processes to be abstracted out of the 
model, and reveals that what all the models have in common is the nature of the varia- 
tion in transmission produced by variation at the modifier locus. All of the models fit 
the form 

T a (i^~j, k) = a P(i<-j, k) for j, k ^ i, 

where a is the modified parameter that represents an overall rate of transf ormation of 
haplotypes, over all haplotypes. This form is referred to as linear variation ( Altenbergl 
1984; Alte nberg and Feld man. 119871) because the modifier gene scales all transmission 
probabilities between different haplotypes equally. 

In the perturbation analysis of the evolutionary dynamics of the modifier locus (to 
be explained in detail in the next section), the stability matrix under linear variation has 
the form: 



M(a,r)D = {(1 - - r)I + rQ] + - r)S + rS M ]} D, 



(1) 



where a is the transmission parameter produced by the new modifier allele, Q, S, 
and S n are stochastic matrices, D is a positive diagonal matrix, and r is the rate of 
recombination between the modifier and the nearest locus under selection. 

The rare modifier allele will increase when rare at a geometric rate if and only if the 
spectral radius p(M(a, r)D) exceeds 1 . This formulation is extremely general, accom- 
modating modifier models with arbitrary numbers of modifier alleles, numbers of loci 
and alleles per locus under selection, selection regimes, and transmission processes. 
Although the model is extremely general, it is tractable thanks to a theorem of 



Karlin<1982): 
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Theorem 1 (Karlin Theorem 5.2, (119821 pp. 194-196)). Let M be an arbitrary non- 
negative irreducible stochastic matrix. Consider the family of matrices 



M(a) = (l-a)I + aM 
Then for any diagonal matrix D with positive terms on the diagonal, the spectral radius 

p(a) = p(M(a)D) 
is decreasing as a increases (strictly provided D ^ dl). 

We see that M(a, r)D fits the form in Karlin's theorem only if r = 0, in which 
case the stability matrix becomes {(1 — a)I + aS}D, and we immediately see that 
a new modifier can invade a populatio n near equilib rium if and only if it reduces 
a so as to produce p(M(a)p ) > 1 dAltenbergfl 19841 Theorem 3.9, pp. 126-128; 
Altenberg and Feldmanll 19871 Result 3). If r > 0, one is left to evaluate 



-p([(l-a)Mi + oMj]D), (2) 

da 

where Mi, M2 ^ I. A closed-form characterization of stochastic matrices Mi and 
M2 that produce dp /da < is not readily obtained. 

The case of general r > would seem unlikely to reverse the reduction result, since 
the only action of recombination with the modifier locus is to blend the equilibrium 
distribution of selected haplotypes with the distribution created by the new modifier 
allele. This blending should not alter the tendency of modifier alleles that reduce the 
transformation rate to become associated with haplotypes of above-average fitness (and 
the converse), which is the essence of the dynamics. The blending would merely lessen 
the association. Even free recombination cannot completely eliminate this emergent 
association: a modifier allele that produces perfect transmission always inva des when 
introd uced, even if unlink ed to the selected loci, i.e. g(M (0, 1/2) D) > 1 dAltenbergl 



1984 Theorem 3.5, p.l 18; Altenberg and Feldman 1987, Result 2). 

Thus it was conjectured in Altenberg and Feldman (1987) that for arbitrary r £ 
[0, 1/2], a new modifier allele can invade if and only if it decreases a. This paper 
proves the conjecture through an extension and expanded use of Karlin's theorem. The 
proof requires the extension of Karlin's theorem to essentially non-negative matrices 
(ML matrices), and to reducible matrices. The last of the constraints needed to prove 
the general reduction principle is thus removed. 

I prefa ce the result with a self-contained review of the g eneral modifier model de- 



velope d in lAltenberg ( 1984); Altenberg and Feldm an ( 1987) and also used in lZhivotovskv et al 



(1994). 



2 The Model 

The evolutionary model examined here fits the general form 



w z'i = }T(i*-j, k) w jk Zj z k , (3) 

jk 
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Figure 1: The configuration of loci in the modifier gene model considered here. M a 
and Mb are the modifier alleles, Gj and Gk are the haplotypes undergoing viability 
selection, consisting of genes Gj 1 ■ ■ ■ Gj n and Gkj ■ ■ ■ Gk n - With probabilities under 
the control of the modifier genotype M a /Mb, haplotype M a Gi is produced. 



where zi is the frequency of haplotype i in the population, z[ is the frequency in the 
next generation, T(i <— j, k) is the probability that parental haplotypes j and k pro- 
duce an offspring haplotype i, Wjk = Wkj is the fitness of diploid genotype jk, and 
w = J2j k Wjk Zj Zk is the mean fitness of the population. The model includes the gen- 
eral assumptions of an infinite population, frequency-independent viability selection, 
random mating, sex symmetry, no sex linkage, and non-overlapping generations. 

The modifier gene model is a special case of (01 in which the genome is structured 
to contain a group of loci under selection, and a neutral locus external to the group 
that modifies their genetic transmission probabilities. The structure is illustrated in 
Fig. Q] Haplotypes will now have two indices, one for the allele at the modifier locus 
(a, b, c etc.), and one for the haplotype of the selected loci k etc.). The modifier 
allele is assumed to be transmitted perfectly (no mutation nor segregation distortion), 
so that the only force acting upon it arises from its association with the selected loci. 
Recombination between the modifier locus and the nearest selected locus occurs at rate 

Tab- 

The situation where the modifier locus is internal to the set of selected loci is not 
considered because it is no longer possible to separate recombination events with the 
modifier locus from transformations acting on the selected haplotype. In order for 
the variation in transmission to be linear in this configuration, there would need to 
be complete interference between recombination events on either side of the modifier 
locus and any other transformation processes being modeled. 
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2.1 The Modifier Gene Model 

Under this genetic structure, the transmission probabilities take on the form: 

T( r )(ai <— aj\bk) := (1 — r ab )T(ai^aj\bk) + r ab T K (ai<—ak\bj) 

where the probability that parental genotype aj, bk produces gamete haplotype ai, con- 
ditioned on the offspring containing modifier allele a, is: 

T(ai*—aj\bk), when no recombination occurs between the modifier and nearest se- 
lected locus, and 

T M (az<— ak\bj), when recombination occurs between the modifier and nearest se- 
lected locus (hence aj\bk becomes ak\bj). 

So, 1 = T(ai^aj\bk) = £V T M [ai < ak\bj) 1 Vo, b, j, k. 

2.2 Dynamical Recursion for the Modifier Model 

With this genetic structure, recursion Q becomes: 



where 

z a i is the frequency of the haplotype with allele a at the modifier locus, and haplotype 
i at the selected loci, and z' ai is for the next generation; 

Wjk is the fitness coefficient for diploid genotype jk at the loci under selection; 

w := J^abjk w jk z aj z bk is the mean fitness of the population, 

The evolutionary analysis consists of asking how the values of Ti T \{ai <— aj\bk) 
determine whether a modifier allele a can invade a population and be protected from 
extinction. 

3 Equilibria and their Stability 

A population at equilibrium under (0]l must satisfy the constraint: 




(4) 



bjk 




(5) 



cjk 



A perturbation of the equilibrium to zu = zu + e b i produces: 



w + 2 ^ tbjWj k Zck + ^ e bj e ck (z bi + e' bi ) 

bjck bjck J 



= )]T( r ) (bi<-bj\ck) w ]k (z bj + e bj ) (z ck + e ck ). 

cjk 
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The long term evolution of genetic transmission depends on the properties that 
allow a new modifier allele to invade a population and be protected from extinction. 
Hence the analysis focuses on perturbations of the equilibrium by rare modifier alleles, 
entailing z a i = for all i for new modifier allele a. Making this substitution, and 
ignoring all second and higher order terms in the perturbation, the linear recursion on 
a new modifier allele, a, that perturbs (0 can be represented in vector form as: 



< = M D e a 

where M is a stochastic matrix, D a positive diagonal matrix, and 



(6) 



[e oi ]™ =1 , D := diag m/w 



M 



Ebfc T (r) (ai*-aj\bk)^z bk 



bj w ij z bj- 



and 



Modifier allele a will increase at a geometric rate when rare if and only if the spectral 
radius p(MD) exceeds 1. Clearly, if D = I, then p(MD) = p(M) = 1, so geometric 
rates of change in modifier allele frequencies require D ^ I, a situation described 
by saying there is a positive s election potential dAltenberglll984 "fitness load" p. 63; 
Altenberg and Feldmanlll987l) : 



V 



1 > 0. 



(7) 



The analysis consists of evaluating the relationship between Tr r )(ai *— aj\bk) and 
p(MD). 

4 Variation in Transmission 



The reduction principle emerges in models where the modifier gene scales all the tran- 
sition probabilities between different genotypes equally. Variation in transmission falls 
along a line that intersects the matrix for perfect transmission (where parental hap- 
lotypes are transmitted unchanged to their gametes, and in equal proportions), which 
functions as an "origin" in t his space of matrices. For thi s reason it is called "linear 
variation" dAltenbera,ll984tlArtenberg and Feldmarl ll987). 

A mechanistic derivation of linear variation is that each selected haplotype j has a 
certain probability, a, of being "hit" by some transforming process, and given that it is 
hit, it is transformed into various other selected haplotypes with different probabilities. 
These probabilities, T(i *—j\k), include possible dependence on both parental haplo- 
types j and k. When the effect of the modifier gene is to scale the "hit" rate, a, up or 
down equally for all haplotypes, it produces linear variation in transmission. 



4.1 Perfect Transmission 

A genetic system that perfectly transmits parental selected haplotypes to gametes can 
be represented by: 



Tfa(ai<-aj\bk) = T^(ai<—aj\bk) = S. Lj , 



(8) 
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where 5ij = 1 if i = j and otherwise. Any genetic system can be characterized by 
the probability with which perfect transmission occurs. For a given modifier genotype 
ab, a global characterization of the extent of perfect transmission can be captured by 
a lower bound, 1 — a ab , on the probability of perfect transmission over all selected 
haplotypes, where 

1 - a ab := min {T(ai^ai\bk),T t< (ai^ai\bk)} G [0, 1]. (9) 

This lower bound can be used to parameterize the transmission probabilities: 

T(ai^aj\bk) = (1 — a ab ) 5ij + a ab P(ai<—aj\bk) (10) 

and 

T™(ai<—aj\bk) = (1 - a ab ) % + a ab P M (ai <- aj\bk) , (11) 

Thus P(ai <— aj\bk) > and P M (ai <— aj\bk) > for all i,j, k, and for some i, k, 
P(ai^ai\bk) = or P™ (ai <- ai\bk) = 0. 

Linear variation is defined to be where the sole effect of the modifier locus is to 
determine a: 

T(ai^aj\bk) = (1 - a ab ) % + a ab P(i<—j\k) (12) 

and 

T*(ai^ak\bj) = (1 - a ab ) 5 lk + a ab P»{i<-k\j). (13) 



4.2 External Stability of a Modifier Monomorphism 

For the purposes of brevity, it becomes useful at this point to restrict the analysis to 
populations that are initially fixed on a single modifier allele. The steps that follow 
apply to modifier polymorphisms, but require additional characterizations of the poly- 

morphisms; mod ifier polymorphisms under generalized transmission are analyzed in 

lAltenberg|(ll984. and for particula r cases of transmission processes , inlLiberman and Feldmanl 
dl986allbl) : Feldman and Liberman Jl986l) : lLiberman and Feldmanl dl989l) . 

The population will initially be at equilibrium, fixed on modifier allele b where 
a bb > 0. Haplotypes will be listed only if they occur with positive frequency, so 
z b i > for all i at equilibrium. To this population at equilibrium, modifier allele 
a is introduced. Linear variation and a bb > preclude the possibility that the new 
modifier allele will cause the production of selected haplotypes not already present in 
the population. 

For readability, let a represent a bb , a represent a ab , and r represent r ab . For linear 
variation (fT2l([T3T>, the stability matrix M in © can be expressed as a function of a and 

r: 



M(a,r) 



ET (r) (ai <- aj | bk)^^z bk 



(14) 



(1 - a)[(l-r)I + rQ^ a[(l-r)S + rS% 
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where 



5> 



Wjk - 



Q 



S := 



> , o ik ——z hk 



- n 








- i,i=l 







111, 



52p(i<-j\k) 

k 

£p"(i«-*ii)^ 



and 



'~bk- 



'■j = l 



Letting e be the vector of ones, and e T its transpose, note that 

e T Q = e T S = e T S* = e T , 



(15) 



since J2i P{i^j\ k ) = Y.i P M {i^k\j) = J2 k WjkZbk/wj = 1- M(a,r) can be used 
to express the equilibrium constraint ©: 



z b = M(d,r)Dz 6 

= {(1 - &)[(l-r)I + rQ] + &[(l-r)S + rS*]} Dz & . 



(16) 



5 Results 



The strategy for proving the reduction result for arbitrary r is to try to allow the im- 
mediate application of Karlin's Theorem 5.2 by representing M(a, r) as M(a, r) = 
(1 — 0)1 + j3Ad, for a choice of Ad where we know p(AiD). Finding an irreducible 
stochastic such Ad would give 



and 



p(M(a, r)D) > p(MD) for < < 1, 

p(M(a, r)D) < p(MD) for /3 > 1. 

For our choice of Ad, we possess the crucial fact from ( fT6l ) that p(M(a, r)D) = 1, 
so we might hope to find (3 and r to give (1 — (3)1 + /3M(a, r) = M(a, r), but this is 
unworkable. Fortunately, there exists a matrix solution M(a, r) = (1 — /3)I + @Ad*, 
but Al* may have negative elements along the diagonal f or certa i n co mbinations of a, 
a, and r. Suc h a matr i x is k nown as a 'Metzler-Leontief dSenetalll98ll) or 'essentially 
nonnegative' dCohenl 1 198 ID matrix. An extension of Karlin's Theorem 5.2 to ML- 
matrices is provided which allows proof for all a, a, and r. We proceed in stages. 
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5.1 Metzler-Leontief (ML) Matrices 

An ML-matrix is a square real matrix where all non-diagonal elements are non-negative. 
The spectral abscissa of a matrix A is defined as tt(A) = ruax.j{Re(Aj)}, where {Xi} 
are the eigenvalues of A. Let B refer to an ML-matrix, where bij > for all i ^ j. 
We utilize the following properties of ML-matrices: 

Lemma 1 (Spectral Abscissa of ML Matrices). 

1. 7r(B) is an eigenvalue o/B, referred to as the Perron root as it is for positive 
matrices; 

2. 7r(B) = /?(B + gT) — g, for any g > — mini {bu}; 

3. If Hz = AB and z > 0, then A = 7r(B). If in addition B is irreducible, then 
there also exists v > such that v T B = 7r(B)v T , and z and v are unique up 
to constant multiples. 



Proof. These are found in, or follo w directly from, Theorem 2.6 in (Seneta, 19811 pp. 



45^6) and Theorem 3 in dGantma cher, 1959, p. 66). □ 



5.2 Solving for M* 

Lemma 2. Define 

M{a, ci, c 2 ) := (1 - a) [(1 — ci) I + ciQ] + a[(l - c 2 )S + c 2 S N ]. 
Then for a, a E (0, 1), and r E [0, 1]: 

1. M(a,r) = (1 -/3)I + /3M(a,ci,C2) is solved by 

(3 = a/a, c\— rd(l ~ a)/[a(l — d)], ant/ c 2 = r, 



fo give 



where 



M 



(a,r)=(l-^)l+?M* ) (17) 
V a) a 



M* := Mfd,r Q {) a) r) (18) 
V a(l-a) / 

1- d-r-+rd ) I + rd ( - - 1 ) Q + d[(l - r)S + rS"]. 
a J \a J 

2. e T A / t((i, Ci, r) = e T ant/ Al(d, c 1; r) D z^ = zj /or any Ci E 5R. 

3. ' an ML-matrix and, if (1 — d)rQ + d[(l — r)S + rS™] is irreducible, 
tt(M*D)=L 

4. If, in addition, a > a or a > a r /[l — d(l — r)], then Al* is non-negative and 
7r(M*D)=p(M*D) = l. 
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Proof. Straightforward evaluation verifies (T% and ( fl~8T > from 

d(l — a) 



(1-d) 



a(l — d) 
a [(l-r)S + rS M ]. 



a(l — a) „ 
r— ^-Q 

a(l — a) 



BydBJl, e T M(a,ci,c 2 ) = [(1 - d)(l - ci +ci) + d(l - c 2 + c 2 )]e T = e T . Observe 
that IDz 6 = QDz 6 = Dz fe : 



QDz h = 



Wit 
w j 



-Zbi 



- n 




diag 


- i)j'=l 





/ ~-' w 



Hence ci, r) D z;, 



- n 




n 


- *)i=i 


— Zbi 

w 


= Dz 5 


!]+d[(l- 


r)S + 


rS M ]Dz 6 


,r) :=M( 


a,r), then 


r, r)Dzt = 


M(a, 


r)Dz b = z b . 



(19) 

The off-diagonal elements in ( fT~8b are non-negative as they all derive from non- 
negative stochastic matrices — Q, S, and S M — multiplied by non-negative coefficients 
under conditions d E (0, 1), c\ > 0, and r € [0, 1]. 

If (1 — a)rQ + d[(l — r)S + rS M ] is irreducible, then since Zf, is strictly positive, 
Zfc is the right Perron eigenvector of Ad(a, C\, r)D with Perron root 1, by LemmaQ] 
((3). The coefficient on I, 1 — d — ra/a + ret, is non-negative when a > d, for then 
1 — a — ra/a + ra > (1 — d)(l — r) > 0, but is negative if a < a r/[l — d(l — r)] 
(found by simple rearrangement). Thence M* may have negative diagonal elements. 
When M* is non-negative, then 7r(A^* D) = p{M* D). □ 

5.3 Extending Karlin's Theorem 5.2 to ML-Matrices 



Friedland 's dl98lb Donsker- Varadhan related variational formula for the spectral radius 
is applied to A > in order to show that it applies also to the Perron root of B = 

Lemma 3 (Variational Formula for ML-Matrices). For any irreducible ML-matrix B, 
the spectral abscissa is 



7r(B) = sup mi > pi 

1 4 — 1 



(20) 



where Vn = {p: p > 0,e T p = 1}. 
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Proof. Define for any matrix A, 



/(A,p,x) -=^2p: 



[Ax], 



(21) 



i=l 



The Donsker-Varadhan related variational formula (lFriedland[[l98ll Corollary 3.1) for 
irreducible A > gives us p(A) — sup peV inf x> o /(A, p, x). Substituting A = 
B+gl. 



p(A) = p(B+gI)= sup inf VV 

„CD X>0 ' ' 



[(B + 0l)x]i 



sup inf > p, 



gp n x>C 

[Bx]j 



Knowing from Lemma[T|([2]) that p(A) = 7r(B) + g proves the lemma. Moreover, 

sup inf /(A,p,x) 
pgp„ x>0 



and 



sup inf /(B,p,x) 

p£T>„ x >° 



are both attained for the same p and x. 



□ 



Lemma 4 (Derivative of the Spectral Abscissa). Let B(/3) be a function of 13 £ 5R, in 
continuity class C 2 , such that B(/3) is an ML-matrix for (3 > 0. Lef p(/?) fln d x (/3) f x 
normalized so e T x(/3) = 1) fee f/ze vectors af which the supremum and the infimum in 
d20t , are attained, respectively. Then 



d/3 



7r(B(/3)) = £>(/3)- 



r dB(/3) 

d/3 



i=l 



Si 09) 



Proof. Let A(/3) = B(/3) + gl > be a non-negative matrix associated with B(/3). 
So ^A(/3) = ^B(f3). By Lemmad©, ^p(A(/3)) = ^tt(B(/3)). Differentiating 

(ED: $/(A,p,x) = 

<9/(A,p,x)dA 9/(A,p,x)dp <9/(A,p,x)dx 



dA 



d/3 



dp d/3 



<3x 



do 



Since p(/3) and x(/3) are unique cri tical points of /(A(/3), p(/3), x(/3)) dFriedland and Karliii . 
19751: Friedlandl fl98ll iKarlinl [l982h : 



<9/(A,p,x) dp 



dp dp 



MP), 

p(/3),x(/3) 



<9/(A,p,x) dx 



9x 



d/3 



A(/3), - °> 

p(/3),x(/3) 
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hence 



><*> - ><«> - 9JS &*w 



A(/3), 
p(/3),x(/3) 



X>(/3) 



= 5>(/3) 



a:* 03) 



□ 

Theorem 2 (Extension to ML-Matrices). Let L fee an irreducible ML-matrix such that 
e T L = e T . Consider the family of matrices 

C(P) = (1-/3)1 + PL 

with f3 > 0. Then for any diagonal matrix D nof a multiple of I, with positive terms on 
the diagonal, the spectral abscissa and Perron root 

7r(CG9)D) 

is strictly decreasing as [3 increases. 

Proof. Let B(/3) = C(/3)D. Applying Lemma|4]we have: 



-4C(/?)D) = 



since ^B(/3) = ^|C(/3) D = (L - I jD. 

Following Karlin's proof in iKarli nidi 9821) . we note that for (3 > 0, 



Hence 

rf7T(C(/?)D) 

dp 



L - 1 = i ([(1 - /3)I + /3L] - 1) = i[CG9) - I]. 



If ^ J(C(/3)-I)Dx(/3)] t _ 1 



tt(C(/?)D)-£>(/3) 



7r(C(/3)D)-Epi(/3)D, 



(22) 



As x(/3) is unique and produces the infimum, for < x ^ x(/3): 
f [C(^| " [C(/3)Dx]. 
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Let y > be the right Perron eigenvector of C(/3). Since e T L = e T C(/3) = e T , 
we know that 7r(C(/3)) = 1, hence C(/3) y = y. Now, set x = D _1 y, where y is 
scaled so that e T x = 1. Assuming D _1 y ^ x(/3), we get: 



(23) 



< 



!>(/?) 



X>(/?) 



We verify that D _1 y 7^ x(/3): should D _1 y = x(/3), then (1231 is an equality, thence 
7r(C(/3)D) = suppg^ = maxiDii. The supremum requires Pi((3) = for 

all {i: D u < maxj which is no nempty because D 7^ dl for any d. In con- 

tradiction, irreducible L implies p > (IFriedlandill98U eq. (3.5), Theorem 3.2, and 
Corollary 3.1). 

Application to d22l gives: 



d/3 



tt(C(/3) 



D) = i ^r(C(/3)D) - f>(/3)D^ < 0. 



□ 



Corollary 1 (Extension to Substochastic-ML Matrices). Let L fee an irreducible ML- 
matrix such that e T L < e T and e T L 7^ e T . By extension from non-negative matrices 
this will be referred to as a substochastic-ML matrix. Consider the family of matrices 
C(/3) = (1— L with f3 > 0. Then for any diagonal matrix D with positive terms 
on the diagonal, the Perron root and spectral abscissa 7r(C(/3)D) is strictly decreasing 
as P increases. 

Proof. The steps are identical to those in the proof o f Theor em[2]except that e T L <^ 



i0e 



e 1 means e 1 C(j3) <^ e 1 , hence tt(C(/3)) < 1 (ISenetalll98ll Corollary 3, p. 30; 
Corollary 3, p. 52 and Lemma [T]©). No assumption that D _1 y 7^ x(/3) is needed, 
and ( 1231 becomes: 

tt(CG8)D) < E ft (/3) ^M^ = 7r(C(/3))][>(/3K < f>W- 
Hence from (|22|, 4^7r(C(/3)D) < 0. No assumption D 7^ dl is used. □ 



5.4 Main Result 

Theorem 3 (The Reduction Principle for Linear Variation). Suppose a new allele at 
a modifier locus is introduced into a population fixed at the modifier locus near an 
equilibrium with a positive selection potential among the selected haplotypes, and a < 
1. The new modifier allele produces linear variation in the transmission of the selected 
loci, and is linked to the nearest selected locus with recombination rate r. 
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If M(a, r) is irreducible, the new modifier allele will increase (decrease) in fre- 
quency at a geometric rate if it brings the transmission closer to (further away from) 
perfect transmission, i.e., a < a (a > a). 

Tjf M(a, r) is reducible, then: 

1. the new modifier allele will increase in frequency at a geometric rate if a < a, 
provided it occurs in at least one haplotype whose marginal fitness differs from 
the mean fitness of the population; and if a > a, it will either: 

2. decrease in frequency at a geometric rate; or 

3. not increase in the case where it occurs in a haplotype whose block in the Frobe- 
nius normal form o/M(a,r) is an isolated block with all marginal fitnesses 
equal. 



Proof. For a = 0, the theorem reduces to Result 2 in lAltenberg and Feldmanl ( 1987). 
Henceforth only a > is considered. Lemma|2] shows that M.* is an ML-matrix for 
all a, a € (0,l),andr E [0,1]. We substitute C(/3) = (1-/3)I+/3M* and /3 = a/a, 
thusC(a/a) = M(a,r). 

Case 1: Irreducible M(a,r)D. 

If M(a, r) is irreducible and D positive on the diagonal, M(a, r)D is irreducible, 
and Theorem|2]applies. Hence for (3 > 0, ^7r(C(/3)D) < 0. Therefore, 

7r(C(a/a)D) > tt(C(1)D) = 1, for a < a, 

and 

7r(C(a/d)D) < tt(C(1)D) = 1, for a > a. 
Since M(a. r) is always non-negative, the Perron root is in fact the spectral radius: 

7r(C(a/d)D) = vr(M(a, r)D) = p(M(a, r)D). 

For a = 1 > a, M* is unsolvable, but as a limit for a < 1, p(M(a, r)D) > 1 is 
assured. 

Case 2: Reducible M(a, r)D. 

If M(a, r) is reducible or D is not strictly positive on the diagonal, M(a, r)D is 
reducible, and Theorem|2]must be extended. To analyze the dependence of the spectral 
abscissa on a in the case of reducible C(/3) or where some = 0, we utilize the 
Frobenius normal form of C(/3)D. The Frobenius normal form permutes the indices 
of C(/3)D so that along the diagonal are irreducible square block matrices C^(/3)D/ l , 
where h indexes the diagonal blocks. 

The Frobenius normal form, A, of a reducible matrix has the structure ( Gantmacher. 
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19591 p. 75): 










A 2 



A*+i, 



k-t+1,2 



\ A i+Sj i A 





A, 



t+s,2 " ' ' -f*-t+s,t 







At+i 



M+s,t + l 






A t+S 



(24) 



Blocks Ai, . . . , A t are referred to as isolated blocks, while A t+1 , . . . , A t+S are re- 
ferred to as non-isolated blocks. Isolation is defined in terms of the rows, so in a 
matrix of the form 

Ai 

A 3 A 2 

the block Ai is defined as isolated, whereas A 2 is non-isolated if A3 ^ 0. Thus, by 
definition, block Ai in Frobenius normal form is always isolated. 
At this point we require the following lemma: 

Lemma 5. The eigenvalues of A. are the eigenvalues of the irreducible diagonal block 
matrices in the Frobenius normal form of A. 

Proof. Let h £ I refers to isolated blocks, and h £ JV refers to non-isolated blocks. 
The non-isolated blocks are contained in a principal submatrix, Ajv% of A. In d24i > this 
would be 

/ A t+ i ... 



Ajv 



t+s,t+l 




(25) 



The eigenvalues of A must be eigenvalues either of an isolated block A^, h £ T, 
or of the submatrix of non-isolated blocks, Ajv". This is readily seen, because, letting x 
be an eigenvector of A, if Ax = A x, then for isolated block h, A^x/, = A x/j, where 
x/j refers to the subvector of x with indices in block h. Hence either 

1. A is an eigenvalue of Aft ; or 

2. Xft = 0. If Xh = for all h S 1, then A^x^v = A x^/ - , hence A is an eigenvalue 
of A^v". 

In submatrix Aj\f, we see from d25l > that A t+ i is an isolated block. Hence, repeat- 
ing the argument above, the eigenvalues of A^ must be eigenvalues of either A t+ i or 
of the principal submatrix containing blocks A t+2 through A t + X . Continuing recur- 
sively in this manner, the lemma is proved. □ 
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Thus the eigenvalues of C(/3)D — including the spectral abscissa — must be 
eigenvalues of the diagonal blocks C; l (/3)D/ l . Therefore, the spectral abscissa for 
C(/3)D is the maximum of the spectral abscissas: 

7r(C(/3)D)=max7r(C ft (/3)D h ). 

h 

The spectral abscissas of each block are either constant, or strictly decreasing: 

1. Constant if ~D h = d h I h and eJC h (/3) = e,[; 

2. Strictly decreasing if either 

(a) ^ d Ih (by Theorem|2]i, or 

(b) C h (P) is substochastic-ML, i.e. e^Cfe(/3) < ej, ^ (by Corollary[T|i- 

In the case where all blocks are isolated, it is not possible for all blocks to be 
constant, as this would entail that D — I, which is contrary to hypothesis (|7). In 
the case where there is at least one non-isolated block, then (referring to the indices 
in d24l >) there is at least one non-zero matrix Ct+kj, for some t € {1, . . . , s}, and 
j € {1, ... , t}. This entails that 

eJ +k C t+kJ ((3) + eJCjifi) < ej, + ej 

hence ejCj(f3) < ej , ^ ej, thus Cj((3) is a substochastic-ML matrix. Hence if TV is 
nonempty, at least one isolated block will be an irreducible substochastic-ML matrix, 
which by Corollary Q] is a decreasing block. Therefore, in all cases, there will be at 
least one isolated block h for which C/ l (/3)D/ l is strictly decreasing in (3. 

We have as a reference point dT6b . where for (3=1, C(l)Dzj , = Zf, > 0. Hence, 
by Lemma Q] ©, 7r(C(l)D) = 1. We know by Theorem 6 in dGantmacherl 1959 



pp. 77-78), extended to ML-matrices by Lemma Q]©, that n(Ch(l)T>h) = 1 for all 
feel, and 7r(C ft (l)D ft ) < 1 for all h £ TV. 

For the parameter range < j3 = a/ a < 1, i.e. a < a, at least one isolated 
block has spectral abscissa strictly decreasing in (3, thus n(Cf l (j3)'Df l ) > 1 for j3 < 1. 
Hence, 

7r(C(a/d)D) = p(M(a, r)D) = max7r(C, ! .(a/d)D /l ) > 1, for a < a. (26) 

h 

As long as the new modifier allele a is introduced into haplotypes that are part of the 
isolated block for which Tt(Ch(a/a)T)h) > 1, i.e e ah ^ 0, then e a will grow at a 
geometric rate. This proves conclusion (fl]i in the theorem. 

For the parameter range (3 — a /a > 1, i.e. a > a, if there are no isolated blocks 
for which D h = d h I h , then 7r(C/ 1 (/3)D^) < 1, for all h e 1 when (3 < I. And 
recalling that 7r(C h (l)D h ) < 1 for all ft, G TV, then 7 r(C, i (/3)D h ) < 1 for all h € TV 
when f3 > 1 since all blocks are either constant or decreasing. Therefore, 

7r(C(a/&)D) = p(M(a,r)D) = max 7 r(C ?l (/3)D; l ) < 1, for (3 > 1, 

h 
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yielding conclusion (O in the theorem. If there is a constant isolated block, which 
requires T> h = d h l h , then tt(C(/3)D) = p{M{a,r)T>) = 1 for f3 > 1. If the new 
modifier allele is introduced associated with haplotypes within the constant block, then 
conclusion (01 in the theorem holds. □ 

Conditions for M(a, r) to be reducible are limited but important. For r = 0, S 
must be reducible. This is the case in general when recombination is the only transfor- 
mation acting on the selected haplotypes, and the Frobenius normal form of S consists 
of isolated irreducible blocks. If no block has all marginal fitnesses the same, then con- 
clusion (2) applies. For M(a, r) to be reducible when r > and a < 1, Q + S + S M 
must be reducible. This is precluded if all fitnesses tuy are positive, making Q > 0. 
For Q + S + S M to be reducible requires either that some haplotypes be lethal, or that 
S + S M be reducible and that specific zeros in S + S M be matched by Wij — 0, a highly 
non-generic selection regime. 



6 Discussion 



6.1 Mathematical Issues 

Theorem[3] establishes the full generality of the reduction principle for modifier genes 
that produce linear variation in genetic transmission, assuring that no exceptions to the 
evolutionary reduction of genetic transformation rates can come about due to selection 
regime, allele or locus multiplicity, different genetic processes, or recombination with 
the modifier locus, for modifiers that produce linear variation in transmission. 

In previous studie s, analytical tractability require d t radeoffs between various sim- 
plifying assumption s: Liberman and Feldman ( 1986a b); Feldm an and Liberman (1986); 
Liberman and Feldmanld 19891) allowed no more than two alleles at each selected locus, 
or two demes in the case of migration modification, in or der to obtain results for all 
r > 0. lAltenbergl d 19840 ; lAltenberg and Feldman! d 1987b required r = or a = 
in order to obtain results for arbitrary allele and locus number, genetic processes, and 
selection regimes. These tradeoff are now removed. 

In the process of proving Theorem[3] I have had to extend Karlin's Theorem 5.2 
to ML-matrices, substochastic matrices, and reducible matrices, which may have ap- 
plication outside the current context of modifier gene theory. Moreover, the treat- 
ment o f reducible matric e s here produces much s t ronge r and more detailed results 
than in lAltenberg ( 1984 ); Altenberg and Fel dman (1987), where the spectral radius 
p(M(a, 0)D was only proven to be non-increasing in j3 for reducible M(a, 0). Here, 
it is proven that p(M(a, r)D is strictly decreasing on a 6 (0, a), and strictly decreas- 
ing on a 6 (&, 1) except when the marginal fitnesses are all equal within an isolated 
block in the Frobenius normal form of M(a, r)D, in which case p(M.(a, r)D = 1. 

It should be noted that the method of proof utilized here does not allow us to say 
that -£-/?(M(a, r)D) < for r > 0. That would be a much more direct means of 
proof of Theorem [3] and would obviate the need to extend Karlin's theorem to ML- 
matrices. A proof, however, runs up against impediments at step d23l l that have not 
been surmounted. 
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Su ch a result would be necessary to say, as was done in Result 2 in lAltenberg and Fe ldman 
dl987b . that the asymptotic strength of selection either for or against a new modifier al- 
lele increases with \a — a\. Theorem [3] alone cannot not rule out the possibility that 
the function p(M(a, r) meanders non-monotonically over (0, a) or (a, 1), only that it 
never crosses the p = 1 line on those intervals. This perverse possibility is not con 



sisten t, however, with the general finding that spectral functions are convex dFriedland , 



19811) . and out of parsimony considerations, I would conjecture that for any r > 0, 



d 



p(M(a,r)D) < 0. 



6.2 Linear Variation 

Linear variation is in a sense 'impartial' in that all transitions — from advantageous to 
deleterious haplotypes, and vice versa — are scaled exactly the same. So the produc- 
tion of both advantageous as well as deleterious haplotypes is reduced by a modifier 
that reduces transformation rates. But Theorem [3] shows that a reducing modifier al- 
lele always generates its own linkage disequilibrium with advantageous haplotypes and 
thus invades the population, while a modifier allele that increases transformation rates 
generates linkage disequilibrium with disadvantageous haplotypes and thus goes ex- 
tinct. This is the essence of the reduction principle. 

Given this generality of the reduction principle for linear variation in transmission, 
we must ask why natural genetic systems in fact depart from the reduction principle 
and do not evolve to perfect transmission. The reduction principle guides us to the list 
of assumptions that must be violated in order for departures from reduction to occur. 

Removal of the near-equilibrium assumption of the model allows several classes 
of departure from reduction. Systems may be kept far from equilibrium by fluc- 
tuating selection, finite population size that produces drift, and de novo mutations. 
Recombination has been found to ev olve under a special form of cyclic selection 



(Charles worth, 1976; Hamilton, 1980), a nd under newly ch anged directional selection 
(Mavnard Smith, 1 19881; ICh arieswo rthT 1 19931: iBartonl U995). and in finite populations 



( Felsensteinl 1 1 974u Felsenstein and Yok ovamaT 19761) . Removal of the assumption of 



Mendelian segregati on of the modifier locu s, e.g. meiotic drive, allows the evolution 
of recombination (IFeldman and OttoLll99ll) . 

Removal of the assumption of linear variation has produced many examples of 
departure from reduction. Mechanistically, linear variation requires that the modifier 
controls the 'hit' rate of a transforming process acting on the selected haplotypes, and 
that this process be the only transforming process, otherwise transmission probabilities 
between different genotypes will not be scaled equally. Furthermore, for a to enter 
linearly in the recursion, the transformation process cannot allow multiple hits. 

When spelled out thus, we see that the mechanistic requirements for linear variation 
are not realized biologically. Multiple hits are the norm in chiasma formation and in 
point mutation. Multiple transformation processes are also the norm, as recombination, 
mutation, gene conversion, deletions, duplications, transpositions, etc. all happen in 
gamete formation. 

The case of multiple hits ha s been explored in a mod el of recombination modifi- 
cation for a multi-locus system dZhivotovskv et al. . 11994 . There, a refined reduction 
principle again holds. 
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The case of multiple simultaneous transformation processes acting during trans- 
mission has received a good deal of theoretical attention, and it is here that numerous 
departur es from the reduction principle have been found. A detailed review can be 
found in Feldman et al. (1997). The cases exemplify the "principal of partial control" 
( Altenberg , 1984 p. 149): when a modifier gene has only partial control over the 
transformations occurring at selected loci, then it may be possible for this part of the 
transformations to evolve an increase. 

The problem of characterizing exactly the conditions on variation in transmission 
that will evolve under modifier gene control remains an open question. Its solution 
will require a deeper understanding of the differential properties of the spectra of non- 
negative matrices. 
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